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CAGCGTCAGACGCAGGGCACTGAGAATGTGCGACAGCGCGCAACGATGAAGTAGCCCAGAGGGTCCCTTG 
GAAAATGAGGCCAGGGTCCCTGCTGCTGCTTGTTCTGCTGCTCGCCCTGTCCAGGAGCCTGCGGGGCAAA 
GAGTGTGCGTCTCCACCCTGTGAGTGTCACCAGGAGGACGACTTCAGAGTCACCTGCAAGGAGCTCCACC 
GAATCCCCAGCCTGCCGCCCAGCACCCAGACTCTGAAGCTCATCGAGACTCATCTGAAGACCATACCCAG 
TCTTGCATTTTCGAGTCTGCCCAATATTTCCAGGATCTATTTATCTATAGATGCAACTCTGCAGCGGCTG 
GAACCACATTCTTTCTACAATTTGAGTAAAATGACTCACATAGAAATCCGGAACACCAGAAGCTTAACCT 
ATATAGACCCTGATGCCTTGACAGAGCTCCCCTTGCTCAAGTTTCTTGGCATTTTCAATACTGGACTTAG 
AATATTCCCTGACTTGACCAAAATTTATTCCACGGACATATTCTTTATACTTGAAATCACAGACAACCCT 
TACATGACTTCGGTCCCTGAAAACGCATTCCAGGGCCTATGCAATGAAACCTTGACCCTGAAACTGTACA 
ACAATGGATTTACTTCAGTCCAAGGACATGCTTTCAATGGAACAAAGCTGGATGCTGTTTACCTAAACAA 
GAATAAATACCTGACAGCTATAGACAACGATGCCTTTGGAGGAGTATACAGTGGACCAACTTTGCTAGAT 
GTGTCTTCCACCAGCGTCACTGCCCTTCCTTCCAAAGGCCTGGAGCACCTCAAAGAACTGATCGCAAAAG 
ACACCTGGACTCTCAAAAAGCTCCCGCTGTCGTTGAGTTTCCTCCACCTCACTCGGGCTGACCTCTCTTA 
CCCGAGCCACTGCTGCGCTTTTAAGAACCAGAAGAAAATCAGGGGAATCCTGGAGTCTTTGATGTGTAAT 
G AG AGC AGT ATC C GGAACCTTC GTC AAAGGAAATC AGTG AAC ATC TTG AGGGGTC C CATC T ACC AGGAAT 
ATGAAGAAGATCCGGGTGACAACAGTGTTGGGTACAAACAAAACTCCAAGTTCCAGGAGAGCCCAAGCAA 
CTCTCACTATTACGTCTTCTTTGAAGAACAAGAGGATGAGGTCGTTGGTTTCGGCCAAGAGCTCAAAAAT 
CCTCAGGAAGAGACTCTCCAAGCCTTCGAGAGCCACTATGACTACACGGTGTGTGGGGACAACGAGGACA 

TGGTGTGT AC C C CC AAGTC GGAC GAGTTTAACCCC TGTGAAG AT ATC ATGGGC T AC AGGTTC C TG AG AAT 
CGTGGTGTGGTTTGTCAGTCTGCTGGCTCTCCTGGGCAATATCTTCGTCCTGCTCATTCTGCTAA.ee AGC 
CACTACAAATTGACCGTGCCGCGGTTCCTCATGTGCAACTTGGCCTTTGCAGATTTCTGCATGGGGGTAT 
ACCTGCTTCTCATTGCCTCTGTAGACCTGTACACACACTC TGAGTACTACAACC ACGCCATCGACTGGCA 
GACGGGCCCTGGGTGCAACACGGCTGGCTTCTTCACTGTTTTCGCCAGTGAGTTATCAGTGTACACACTG 
ACGGTCATCACCCTGGAGCGATGGTACGCCATCACCTTCGCCATGCGCCTGGATAGGAAGATCCGCCTCA 
GGCACGCGTACACCATCATGGCTGGGGGCTGGGTTTCCTGCTTCCTTCTCGCCCTGCTCCCGATGGTGGG 
AATCAGCAGCTATGCCAAGGTCAGCATCTGCCTGCCAATGGACACCGACACCCCTCTTGCACTCGCATAC 
ATTGTCCTCGTTCTGCTGCTCAATGTTGTTGCCTTTGTTGTCGTCTGTTCCTGCTATGTGAAGATCTACA 
TCACGGTCCGAAATCCCCAGTACAACCCTCGAGATAAAGACACCAAGATTGCCAAGAGGATGGCTGTGTT 
GATCTTCACTGACTTCATGTGCATGGCGCCCATCTCCTTCTATGCGCTGTCGGCACTTATGAACAAGCCT 
CTAATCACTGTTACTAACTCCAAAATCTTGTTGGTTCTCTTCTACCCCCTCAACTCCTGTGCCAATCCGT 
TTCTCTATGCTATTTTCACCAAGGCCTTCCAGAGGGACGTGTTCATCCTGCTCAGCAAGTTTGGCATCTG 
CAAACGCCAGGCCCAGGCCTATCAGGGTCAGAGAGTCTGTCCCAACAATAGCACTGGTATTCAGATCCAA 
AAGATTCCCCAGGACACGAGGCAGAGTCTCCCCAACATGCAAGATACCTATGAACTGCTTGGAAACTCCC 
AGCTAGCTCCAAAACTGCAGGGACAAATCTCAGAAGAGTATAAGCAAACAGCCTTGTAAAGGAAAGGCTA 
CGCTAGTCACAGTGAGACTTACAAAAGGCTGGTTTCTTGAACATGCGTTCCAGTCCCGTGACATGTGAAC 
ACAT AGGTTC ATGCAGGTGATGATTCATAGGGTCAGAGTTCATCTCTAGAAAGTATTGCCTC 
(SEQ ID N0:1) 
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MRPGSLLLLVLLLALSRSLRGKECASPPCECHQEDDFRVTCKELHRIPSLPPSTQTLKLIETHLKTIPSLAFSSLPN 
ISRIYLSIDATLQRLEPHSFYNLSKMTHIEIRNTRSLTYIDPDALTELPLLKFLGIFNTGLRIFPDLTKIYSTDIFF 
ILEITDNPYMTSVPENAFQGLCNETLTLKLYNNGFTSVQGHAFNGTKLDAVYLNKNKYLTAIDNDAFGGVYSGPTLL 
DVSSTSVTALPSKGLEHLKELIAKDTWTLKKLPLSLSFLHLTRADLSYPSHCCAFKNQKKIRGILESLMCNESSIRN 
LRQRKSVNILRGPIYQEYEEDPGDNSVGYKQNSKFQESPSNSHYYVFFEEQEDEWGFGQELKNPQEETLQAFESHY 
DYTVCGDNEDMVCTPKSDEFNPCEDIMGYRFLRIWWFVSLLALLGNIFVLLILLTSHYKLTVPRFLMCNLAFADFC 
MGVYLLLIASVDLYTHSEYYNHAIDWQTGPGCNTAGFFTVFASEL1SVYTLTVITLERWYAITFAMRL1DRKIRLRHAY 
TIMAGGWVSCFLLALLPMVGISSYAKVSICLPMDTDTPLALAYIVLVLLLNWAFVWCSCYVKIYITVRNPQYNPR 
DKDTKIAKRMAVLIFTDFMCMAPISFYALSALMNKPLITVTNSKILLVLFYPLNSCANPFLYAIFTKAFQRDVFILL 
SKFGICKRQAQAYQGQRVCPNNSTGIQIQKIPQDTRQSLPNMQDTYELLGNSQLAPKLQGQISEEYKQTAL 
(SEQ ID NO: 2 ) 
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underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



[CAGCGTCAGACGCAGGGCACTGAGAATGTGCGACAGCGCGCAACGATGAAGTAGCCCAG 

ar;nOTPrrTT(':r:a^aATCARC:rrRflr!CTf'rr 1 TGCTGCTGCTT GTTCTGCTGCTCGCCCT 

GTCCAGGAGCCTGCGGGGCAAAGAGTGTGCGTCTCCACCCTGTGA [GTGTCACCAGGAGG 

ACGACTTCAGAGTCACCTGCAAGGAGCTCCACCGAATCCCCAGCCTGCCGCCCAGCACCC 

AGACTCT ] GAAGCTCATCGAGACTCATCTGAAGACCATACCCAGTCTTGCATTTTCGAGT 

CTGCCCAATATTTCCAGGATCTATTTATCTATAGATGCAACTCTGCAGCGGCTGGAACCA 

CATTCTTTCTACAATTTGAGTAAAATGACTCACATAGAAATCCGGAACACCAGAAGCTTA 

ACCTATATAGACCCTGATGCCTTGACAGAGCTCCCCTTGCTCAAGTTTCTTGGCATTTTC 

AATACTGGACTTAGAATATTCCCTGACTTGACCAAAATTTATTCCACGGACATATTCTTT 

ATACTTGAAATCACAGACAACCCTTACATGACTTCGGTCCCTGAAAACGCATTCCAGGGC 

CTATGCAATGAAACCTTGACCCTGAAACTGTACAACAATGGATTTACTTCAGTCCAAGGA 

CATGCTTTCAATGGAACAAAGCTGGATGCTGTTTACCTAAACAAGAATAAATACCTGACA 

GC T AT AG AC AAC G ATGC C TTTGG AGG AGT ATAC AGTGG ACC AAC TTTGCT AG ATGTGTC T 

TCCACCAGCGTCACTGCCCTTCCTTCCAAAGGCCTGGAGCACCTCAAAGAACTGATCGCA 

AAAGACACCTGGACTCTCAAAAAGCTCCCGCTGTCGTTGAGTTTCCTCCACCTCACTCGG 

GCTGACCTCTCTTACCCGAGCCACTGCTGCGCTTTTAAGAACCAGAAGAAAATCAGGGGA 

ATCCTGGAGTCTTTGATGTGTAATGAGAGCAGTATCCGGAACCTTCGTCAAAGGAAATCA 

GTGAACATCTTGAGGGGTCCCATCTACCAGGAATATGAAGAAGATCCGGGTGACAACAGT 

GTTGGGTACAAACAAAACTCCAAGTTCCAGGAGAGCCCAAGCAACTCTCACTATTACGTC 

TTCTTTGAAGAACAAGAGGATGAGGTCGTTGGTTTCGGCCAAGAGCTCAAAAATCCTCAG 

GAAGAGACTCTCCAAGCCTTCGAGAGCCACTATGACTACACGGTGTGTGGGGACAACGAG 

GACATGGTGTGTACCCCCAAGTCGGACGAGTTTAACCCCTGTGAAGATATCATGGGCTAC 

AGGTTCCTGAGAATCGTGGTGTGGTTTGTCAGTCTGCTGGCTCTCCTGGGCAATATCTTC 

GTCCTGCTCATTCTGCTAACCAGCCACTACAAATTGACCGTGCCGCGGTTCCTCATGTGC 

AACTTGGCCTTTGCAGATTTCTGCATGGGGGTATACCTGCTTCTCATTGCCTCTGTAGAC 

CTGTACACACACTCTGAGTACTACAACCACGCCATCGACTGGCAGACGGGCCCTGGGTGC 

AACACGGCTGGCTTCTTCACTGTTTTCGCCAGTGAGTTATCAGTGTACACACTGACGGTC 

ATCACCCTGGAGCGATGGTACGCCATCACCTTCGCCATGCGCCTGGATAGGAAGATCCGC 

CTCAGGCACGCGTACACCATCATGGCTGGGGGCTGGGTTTCCTGCTTCCTTCTCGCCCTG 

CTCCCGATGGTGGGAATCAGCAGCTATGCCAAGGTCAGCATCTGCCTGCCAATGGACACC 

GACACCCCTCTTGCACTCGCATACATTGTCCTCGTTCTGCTGCTCAATGTTGTTGCCTTT 

GTTGTCGTCTGTTCCTGCTATGTGAAGATCTACATCACGGTCCGAAATCCCCAGTACAAC 

CCTCGAGATAAAGACACCAAGATTGCCAAGAGGATGGCTGTGTTGATCTTCACTGACTTC 

ATGTGCATGGCGCCCATCTCCTTCTATGCGCTGTCGGCACTTATGAACAAGCCTCTAATC 

ACTGTTACTAACTCCAAAATCTTGTTGGTTCTCTTCTACCCCCTCAACTCCTGTGCCAAT 

CCGTTTCTCTATGCTATTTTCACCAAGGCCTTCCAGAGGGACGTGTTCATCCTGCTCAGC 

AAGTTTGGCATCTGCAAACGCCAGGCCCAGGCCTATCAGGGTCAGAGAGTCTGTCCCAAC 

AATAGCACTGGTATTCAGATCCAAAAGATTCCCCAGGACACGAGGCAGAGTCTCCCCAAC 

ATGCAAGATACCTATGAACTGCTTGGAAACTCCCAGCTAGCTCCAAAACTGCAGGGACAA 

ATCTCAGAAGAGTATAAGCAAACAGCCTTGTAAAGGAAAGGCTACGCTAGTCACAGTGAG 

ACTTACAAAAGGCTGGTTTCTTGAACATGCGTTCCAGTCCCGTGACATGTGAACACATAG 

GTTCATGCAGGTGATGATTCATAGGGTCAGAGTTCATCTCTAGAAAGTATTGCCTC 
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Gene Sequence Structure 



91 bp 



Sequence Deleted 



163 bp 



Size of full-length 
cDNA: 2512 bp 



Targeting Vector* 
(genomic sequence) 



Arm Length: 
5": 5 kb 
3 1 : 1.3 kb 



5' arm 



LacZ-Neo 
Cassette 



3' arm 




Taig.eting Vectoi 
Endogenous Locus 



* Not drawn to scale 



5 ' >ACTTGAGAGCCTCTCCTTCCC 
CCTCTCCAGCGTGCTCTCCAGCGA 
TGAGGTCACAGCCCCTCGGAGCCC 
TCCTCCTCCCTCCCTTCCCCTCCT 
GCACCCGGGTCTCTTCCAGCGTCA 
GACGCAGGGCACTGAGAATGTGGC 
GACAGCGCGCAACGATGAAGTAGC 
C C AGAGGGTC CC TTGGAAAATGAG 
GCCAGGGTCCCO ' 
(SEQ ID NO: 3) 



5 1 > GTGTC AC C AGG AGG ACGAC TT 
CAGAGTCACCTGCAAGGAGCTCCA 
CCGAATCCCCAGCCTGCCGCCCAG 
CACCCAGACTCTGTGAGTAGCCAA 
GGCCAAGACCCCCCCCCCCCGAGA 
AATTC GTGGTGTGTGTTGGGGTGT 
GCGCGGATATCTGGTCAGTCCCTG 
TACAAATTCAATCCCCCATGCTCG 
GGAAGGTC AGC < 3 ' 
(SEQ ID NO: 4) 
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